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Expression System 

The present invention relates to polypeptides which produce an 
immune response which is protective against infection by 
5 Bacillus anthracla, to methods of producing these, to 

recombinant Eacherlschla ooli cells, useful in the methods, and 
to nucleic acids and trans format ion vectors used. 

Present systems for expressing PA for vaccine systems use 
10 protease deficient Bacillus subtili* as the expression host. 
Although .such systems are acceptable in terms of product 
quantity and purity, there are significant drawbacks. Firstly, 
regulatory authorities are generally unfamiliar with this host, 
and licensing decisions may be delayed as a result. More 
15 importantly, the currently used strains of Bacillus subtllls 

produce thermostable spores which require the use of a dedicated 
production plant. 

WOOO/02522 describes in particular VEE virus replleons which 
20 express PA or certain immunogenic fragments. 

E. coll is well known as an expression system for a range of 
human vaccines. While the ability to readily ferment E. coll to 
very high cellular densities makes this bacterium an ideal host 

25 for the expression of many proteins, previous attempts to 

express and purify recombinant PA from E. coli cytosol have been 
hindered by low protein yields and proteolytic degradation 
(Singh et al., J. Biol. Chem. (19B9) 264; 11099-11102, Vodkin et 
al., Cell (1903) 34; 693-697 and sharma et al., Protein Expr. 

30 purif. (1996), 7, 33-36) . 

A strategy for overexprassing PA as a stable, soluble protein in 
the E. coll cytosol has been described recently (Willhite et 
al., Protein and Peptide Letters, (1998), 5; 273-278). The 
35 strategy adopted is one of adding an affinity tag sequence to 

the texroinus of PA, which allows a simple purification system. 
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A problem with this system is that it requires a further 
downstream processing step in order to remove the tag before the 
PA can be used. 

5 Codon optimisation is a technique which is now well known and 
used in the design of synthetic genes. There is a degree of 
redundancy in the genetic code, in so far as most amino acids 
are coded for by more than one codon sequence. Different 
organisms utilise one or other of these different codons 
10 preferentially. By optimising codons, it is generally expected 
that expression levels of the particular protein will be 
enhanced. 

This is generally desirable, except where, as in the case of PA, 
IS higher expression levels will result in proteolytic degradation 
and/ or cell toxicity* In such cases, elevating expression 
levels might be counter-productive arid result in significant 
cell toxicity. 

20 Surprisingly however, the applicants have found that this is not 
the case in E, coli and that in this system, codon optimisation 
results in expression of unexpectedly high levels of recombinant 
PA f irrespective of the presence or absence of proteolytic 
enzymes within the strain. 

25 

Furthermore, it would appear that expression of a protective 
domain of PA does not inhibit expression in E. coll. 

The crystal structure of native PA has been elucidated (Petosa 
30 C, et al. Nature 385: 833-838,1997) and shows that PA consists 
of four distinct and functionally independent domains: domain 1, 
divided into la, 1-167 amino acids and lb, 166-258 amino acids; 
domain 2, 259-487 amino acids; domain 3, 488-595 amino acids and 
domain 4, 596-735 amino acids. 

35 
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The applicants have identified that certain domains appear to 
produce surprisingly good protective effects when used in 
isolation/ in fusion proteins or in combination with each other . 

5 According to the present invention there is provided an 

immunogenic reagent which produces an immune response which is 
protective against Bacillus anthracla, said reagent comprising 
one or more polypeptides which together represent up to three 
domains of the full length Protective Antigen (PA) of B. 

10 anthracis or variants of these , and at least one of said domains 
comprises domain 1 or domain 4 of PA or a variant thereof. 

Specifically, the reagent will comprise mixtures of polypeptides 
or fusion peptides wherein individual polypeptides comprise one 
15 of more individual domains of PA. 

In particular, the reagent comprises polypeptides) comprising 
domain l or domain 4 of PA or a variant thereof, in a form other 
than full length PA. Where present, domains are suitably 
20 complete, in particular domain 1 is present in its entirety. 

The term polypeptide" used herein includes proteins and 
peptides . 

25 As used herein, the expression "variant" refers to sequences of 
amino acids which differ from the basic sequence in that one or 
more amino acids within the sequence are deleted or substituted 
for other amino acids, but which still produce an immune 
response which is protective against Bacillus enthralls. Amino 

30 acid substitutions may be regarded as * conservative" where an 
amino acid ia replaced with a different amino acid with broadly 
similar properties. Non-conservative substitutions are where 
amino acids are replaced with amino acids of a different type. 
Broadly speaking, fewer non-conservative substitutions will be 

35 possible without altering the biological activity of the 

polypeptide. Suitably variants will be at least 60^ identical, 
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preferably at least 75% identical, and more preferably at least 
90% identical to the PA sequence. 

In particular, the identity of a particular variant sequence to 
5 the PA sequence may be assessed using the multiple alignment 
method described by Lipman and Pearson, Cfcipman, D.J* & Pearson, 
W.R. (1985) Rapid and Sensitive Protein similarity Searches, , 
Science, vol 227, ppl435-1441) • The "optimised" percentage score 
should be calculated with the following parameters for the 
10 Lipman-Pearson algorithm: ktup =1, gap penalty «4 and gap penalty 
length »12. The sequences for which similarity is to be 
assessed should be used as the M test sequence" which means that 
the base sequence for the comparison, (SEQ ID NO 1) , should be 
entered first into the algorithm. 

15 

Preferably, the reagentof the invention includes a polypeptide 
which has the sequence of domain 1 and/or domain 4 of wild-type 
PA. 

20 A particularly preferred embodiment of the invention comprises 
domain 4 of the PA of B. anthracls. 

These domains comprise the following sequences shown in the 
following Table 1. 



2$ Table 1 

Domain Amino acids of full-length PA* 

4 596-735 

1 1-258 



These amino acids numbers refer to the sequence as shown in 
Welkos et aJU Gene 69 (1988) 267-300 and are illustrated 
hereinafter as SEQ ID NOs 15 (Fig 4) and 3 {Fig 3) respectively* 

30 

Domain 1 comprises two regions, designated la and lb. Region la 
comprises amino acids 1-167 whereas region lb is from amino acid 
168-258. It appears that region la is important for the 
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production of a good protective response, and the full domain 
may be preferred. 

In a particularly preferred embodiment , a combination of domains 
5 1 and 4 or protective regions thereof, are used as the 

immunogenic reagent which gives rise to an immune response 
protective against B. anthracia. This combination, for example 
as a fusion peptide, may be expressed using the expression 
system of the invention as outlined hereinafter. 

10 

When domain 1 is employed, it is suitably fused to domain 2 of 
the PA sequence, and may preferably be fused to domain 2 and 
domain 3. 

15 Such combinations and their use in prophylaxis or therapy forms 
a further aspect of the invention. 

Suitably the domains described above are part of a fusion 
protein, preferably with an N-terminal glutathione-s-transferas© 
20 protein (GST) . The GST not only assists in the purification of 
the protein, it may also provide an adjuvant effect, possibly as 
a result of increasing the size. 

The polypeptides of the invention are suitably prepared by 
25 conventional methods. For example, they may be synthesised or 

they may be prepared using recombinant dna technology, in 

particular, nucleic acids which encode said domains are included 

in an expression vector, which is used to transform a host cell. 

culture of the host cell followed by isolation of the desired 
30 polypeptide can then be carried out using conventional methods. 

Nucleic acids, vectors and transformed cells used in these 

methods form a further aspect of the invention. 

Generally speaking, the host cells used will be those that are 
35 conventionally used in the preparation of PA, such as Bacillus 
subtxlis. 
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The applicants have found surprisingly that the domains either 
in isolation or in combination., maybe successfully expressed in 
E. coll under certain conditions . 

5 Thus, the present invention further provides a method for 

producing an immunogenic polypeptide which" produces an immune 
response which is protective against B. anthracls, said method 

comprising transforming an E. coll host with a nucleic acid 

» 

which encodes either (a) the protective antigen (PA) of Bacillus 
10 anthracls or a variant thereof which can produce a protective 
immune response, or (b) a polypeptide comprising* at least one 
protective domain of the protective antigen (PA) of .Bacillus 
anthracls or a variant thereof which can produce a protective 
immune response as described above, culturing the transformed 
15 host and recovering the polypeptide therefrom, provided that 

where the polypeptide is the protective antigen (PA) of Bacillus 
anthracls or a variant thereof which can produce a protective 
immune response/ the percentage of guanidine and cytokine 
residues within the said nucleic acid is in excess of 35%. 

20 

Using these options, high yields of product can be obtained 
using a favoured expression host. 

A table showing codons and the frequency with which they appear 
25 in the genomes of Escherichia coli and Bacillus anthracls is 
shown in Figure 1. It is clear that guanidine and cytosine 
appear much more frequently in E.coll than B. anthracls. 
Analysis of the codon usage content reveals the following: 



30 



Species 


l flC letter 
of codon GC 


2nd letter 
of Codon GC 


3rd letter 
of Codon GC 


Total GC 
content 


E> coll 


58*50% 


40.70% 


54.90% 


51.37% 


B. anthracls 


44.51% 


31.07% 


25.20% 


33.59% 



Thus it would appear that codons which are favoured by E. coll 
are those which include guanidine or cytosine where possible. 
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By increasing the percentage of guanidine and cytosine 
nucleotides in the sequence used to encode the immunogenic 
protein over that normally found in the wild-type 5. aathracis 
gene, the codon usage will be such that expression in caii is 
5 improved. 

Suitably the percentage of guanidine and cytosine residues 
within the coding nucleic acid used in the invention, at least 
where the polypeptide is the protective antigen (PA) of Bacillus 
10 anthracis or a variant thereof which can produce a protective 
immune response, is in excess of 40%, preferably in excess of 
45% and moat preferably from 50-52% ♦ 

High levels of expression of protective domains can be achieved, 
15 with using the wild^type 5, anthracis sequence encoding these 
units • However, the yields may be improved further by 
increasing the Gc% of the nucleic acid as described above. 

In a particular embodiment, the method involves the expression 
20 of PA of B. anthracis. 

Further according to the present invention, there is provided a 
recombinant Each&rXsohia coll cell which has been transformed 
with a nucleic acid which encodes the protective antigen (PA) of 
25 Bacillus anthxacis or a variant thereof which can produce a 
protective immune response , and wherein the percentage of 
guanidine and cytosine residues within the nucleic acid is in 
excess of 35%. 

30 As before, suitably the percentage of guanidine and cytosine 
residues within the coding nucleic acid is in excess of 40%, 
preferably in excess of 45% and most preferably from 50-52%. 

Suitably, the nucleic acid used to transform the E. coll cells 
35 of the invention is a synthetic gene. In particular, the 
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nucleic acid is of SEQ ID NO 1 as shown in Figure 2 or a 
modified form thereof. 

The expression '"modified form" refers to other nucleic acid 
5 sequences which encode PA or fragments or variants thereof which 
produce a protective immune response but which utilise some 
different codona, provided the requirement for the percentage GC 
content in accordance with the invention is met. Suitable 
modified forms will be at least 80% similar, preferably 90% 
10 similar and most preferably at least 95% similar to SEQ ID NO 1. 
in particular, the nucleic acid comprises SEQ ID NO 1* 

In an alternative embodiment , the invention provides a 
recombinant Escherichia coll cell which has been transformed 
15 with a nucleic acid which encodes a protective domain of the 
protective antigen (PA) of Bacillus anthracls or a variant 
thereof which can produce a protective immune response* 

Preferably, the nucleic acid encodes domain 1 or domain 4 of 
20 B. anthracls* 

Further according to the invention there is provided a method of 
producing immunogenic polypeptide which produces an immune 
response which is protective against B. anthracls, said method 
25 comprising oulturing a cell as described above and recovering 

the desired polypeptide from the culture. Such methods are well 
known in the art. 

In yet a further aspect/ the invention provides an E* coll 
30 transformation vector comprising a nucleic acid which encodes 
the protective antigen (PA) of Bacillus anthracls or a variant 
thereof which can produce a protective immune response, and 
wherein the percentage of guanidine and cytosine residues within 
the nucleic acid is in excess of 35%. 

35 
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A still further aspect of the invention comprises an coll 
transformation vector comprising a nucleic acid which encodes a 
protective domain of the protective antigen (PA) of Bacillus 
anthracls or a variant thereof which can produce a protective 
5 immune response. 

Suitable vectors for use in the transformation of E. coii are 
well known in the art. For example, the T7 expression system 
provides good expression levels. However a particularly 
10 preferred vector comprises pAG163 obtainable from Avecia (UK) - 

A nucleic acid of SEQ ID NO 1 or a variant thereof which encodes 
PA and which has at 35ft, preferably at least 40%, more 
preferably at least 45% and most preferably from 50-52% GC 
15 content form a further aspect of the invention. 

If desired, PA of the variants, or domains can be expressed as a 
fusion to another protein, for example a protein which provides 
a different immunity, a protein which will assist in 
20 purification of the product or a highly expressed protein (e.g. 
thioredoxin, GST) to ensure good initiation of translation. 

Optionally, additional systems will be added such as T7 lysozyme 
to the expression system, to ijnprove the repression of the 
25 system, although, in the case of the invention, the problems 
associated with cell toxicity have not been noted. 

Any suitable E* coll strain can be employed in the process of 
the invention. Strains which are deficient in a number of 
30 proteases (e.g. Ion", ompT") are available, which would be 

expected to minimise proteolysis. However, the applicants have 
found that there is no need to use such strains to achieve good 
yields of product and that other known strains such as K12 
produce surprisingly high product yields. 

35 
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Fermentation of the strain is generally carried out under 
conventional conditions as would be understood in the art- For 
example, fermentations can be carried out as batch cultures, 
preferably in large shake flasks, using a complex medium 
5 containing antibiotics for plasmid maintenance and with addition 
of IPTG for induction. 

Suitably cultures are harvested and cells stored at -20°C until 
required for purification. 

10 

Suitable purification schemes for E. coli PA (or variant or 
domain) expression can be adapted from those used in B. avbtills 
expression. The individual purification steps to be used will 
depend on the physical characteristics of recombinant PA- 
IS Typically an ion exchange chromatography separation is carried 
out under conditions which allow greatest differential binding 
to the column followed by collection of fractions from a shallow 
gradient, in some cases, a single chromatographic step may be 
sufficient to obtain product of the desired specification. 

20 

Fractions can be analysed for the presence of the product using 
SDS PAGE or Western blotting as required. 

As illustrated hereinafter, the successful cloning and 
25 expression of a panel of fusion proteins representing intact or 
partial domains of rPA has been achieved. The immunogenicity and 
protective efficacy of these fusion proteins against STI spore 
challenge has been assessed in the A/ J mouse model. 

30 All the rPA domain proteins were immunogenic in A/J mice and 

conferred at least partial protection against challenge compared 
to the GST control immunised mice. The carrier protein, GST 
attached to the NT-terminus of the domain proteins, did not 
impair the immunogenicity of the fusion proteins either in vivo, 

35 shown by the antibody response stimulated in immunised animals, 
or in vitro as the fusion proteins could be detected with anti- 
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rPA antisera after Western blotting, indicating that the GST tag 
did not interfere with rPA epitope recognition. Immunisation 
with the larger fusion proteins produced the highest titres. In 
particular, mice immunised with the full length GST 1-4 fusion 

5 protein produced a mean serum anti-rPA concentration 

approximately eight times that of the rPA immunised group 
(Figure 5) ♦ Immunisation of mice with rPA domains 1-4 with the 
GST cleaved, off, produced titres of approximately one half those 
produced by immunisation with the fusion protein ► Why this 

10 fusion protein should be much more immunogenic is unclear. It is 
possible that the increased size of this protein may have an 
adjuvantising effect on the immune effector cells. It did not- 
stimulate this response to the same extent in the other fusion 
proteins and any adjuvantising effect of the GST tag did not ■ 

15 enhance protection against challenge as the cleaved proteins 
were similarly protective to their fusion protein counterparts. 

Despite having good anti-rPA titres, some breakthrough in 
protection at the lower challenge level of lO^MLD' s, occurred in 

20 the groups immunised with GST1, cleaved 1, GSTlb-*2, GSTIb-3 and 
GST1-3 and immunisation with these proteins did not prolong the 
survival time of those mice that did succumb to challenge, 
compared with the GST control immunised mice. This suggests 
that the immune response had not been appropriately primed by 

25 . these proteins to achieve full resistance to the infection. As 
has been shown in other studies in mice and guinea pigs (Little 
S.P. et al. 1986. 'Infect. Imraun. 52: 509-512, Turnbull P.C.B., 
et al., 1986. Infect. Immun. 52: 356-363) there is no precise 
correlation between antibody titre to PA and protection against 

30 challenge. However a certain threshold of antibody is recfuired 
for protection (Cohen S et. al., 2000 infect. Immun. 6B: 4549- 
4558), suggesting that cell mediated components of the immune 
response are also required to be stimulated for protection 
(Williamson 1989) . 

35 

SUBSTITUTE SHEET (RULE 26) 
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GST1, GSTlb-2 and GSTl-2 were the least stable fusion proteins 
produced, a9 shown by SDS-Page and Western blotting results , 
possibly due to the proteins being more susceptible to 
degradation in the absence of domain 3, and this instability may 
5 have resulted in the loss of protective epitopes. 

The structural conformation of the proteins may also be 
important for stimulating a protective immune response. The 
removal of Domain la from the fusion proteins gave both reduced 

10 antibody titres and less protection against challenge f when 
compared to their intact counterparts GSTl-2 and GST1-3- 
Sl m ilarly, mice immunised with GST 1 alone were partially 
protected against challenge, but when combined with domain 2, as 
the GSTl-2 fusion protein, full protection was seen at the 10 2 

15 MLD challenge lsvel. However the immune response stimulated by 
immunisation with the GSTl-2 fusion protein was insufficient to 
provide full protection against the higher 10 3 MLD's challenge 
level, which again could be due to the loss of protective 
epitopes due to degradation of the protein. 

20 

All groups immunised with truncates containing domain 4, 
including GST 4 alone, cleaved 4 alone and a mixture of two 
individually expressed domains, GST 1 and GST 4 were fully 
protected against challenge with 10 3 MLDs of STI spores (Table 

25 1) . Brassier et al showed a decrease in protection in mice 

immuniged with a mutated strain of B.aathiracls that expressed PA 
without domain 4 (Brossier F., et al. 2000. Infect. Immun. 6B: 
1781-1785) and this was confirmed in this study, where 
immunisation with GST 1-3 resulted in breakthrough in protection 

30 despite good antibody titres. These data indicate that domain 4 
is the immunodominant sub-unit of PA. Domain 4 represents the 
139 amino acids of the carboxy terminus of the PA polypeptide. 
It contains the host cell receptor binding region (Little S.F. 
et al., 1996 Microbiology 142: 707^715), identified as being in 

35 and near a small loop located between amino acid residues 679- 
693 (Varughese M., et al. 1999 infect. Imnnin. 67:1860-1865). 



PAGE 18/45 ■ RCVD AT 10124/2005 1:29:15 PM [Eastern Daylight Time] 1 SVfcUSPTO-EFXRF-6133 » DNB:2738300 f CSID:18584108298 * DURATION (mnKS):10-36 



24-QCT-2005 10:29AM FROM-Gen-Proba Patent Dept. 



1 858 410 8298 T-027 P. 01 9/045 F-066 



WO 02/04*46 PCT/GB01/03065 

13 

Therefore it is "essential for host, cell intoxication as it has 
been demonstrated that forms of PA expressed containing 
mutations (Varughese 1999 supra.) or deletions (Brosaier 1999 
supra.) in the region of domain 4 are non-toxic. The crystal 
5 structure of PA shows domain 4, and in particular a 19 amino 
acid loop of the domain (703-722), to be more exposed than the 
other three domains which are closely associated with each other 
(Petosa 1997 supra.)- This structural arrangement may make 
domain 4 the most prominent epitope for recognition by immune 
10 effector cells, and therefore fusion proteins containing domain 
4 would elicit the most protective immune response. 

This investigation has further elucidated the role of PA in the 
stimulation of a protective immune response demonstrating that 
15 protection against anthrax infection can he attributed to 
individual domains of PA. 

The invention will now be particularly described by way of 
example, with reference to the accompanying drawings in which: 

20 

Figure 1 is a Table of codon frequencies found within E. coll 
and B. anthracls ; 

Figure 2 shows the sequence of a nucleic acid according to the 
25 invention, which encodes PA of B. aubtilis, as published by 
Wellcoa et al supra; and 

Figure 3 shows SEQ IP NOs 3-14, which are amino acid and DNA 
sequences used to encode various domains or combinations of 
30 domains of PA as detailed hereinafter; 

Figure 4 shows SEQ ID NOs 15-16 which are the amino acid and DNA 
sequences of domain 4 of PA respectively; and 

35 Figure 5 is a table showing anti-rPA igG concencentration, 37 
days post primary immunisation, from A/J mice immunised 
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intramuscularly on days 1 and 2fl with lOug of fusion protein 
included PA fragment; results shown are mean + sem of samples 
taken from 5 mice per treatment group. 

5 Example 1 

Investigation into expression in E.coli 
rPA expression plasmid pAGl63::rPA has been modified to 
substitute Km R marker for original Tc R gene. This plasmid has 
been transformed into expression host E. coll BLR (OE3> and 
10 expression level and solubility assessed. This strain is 

deficient in the intracellular protease La (Ion gene product) 
and the outer membrane protease QmpT. 

Expression studies did not however show any improvement in the 
15 accumulation of soluble protein in this strain compared to Ion+ 
K12 host strains (i.e. accumulation is prevented due to 
excessive proteolysis) . It was concluded that any intracellular 
proteolysis of rPA was not due to the action of La protease. 

20 Example 2 

Fermentation analysis 

Further analysis of the fermentation that was done using the K12 
strain UT56Q0 (DE3) pAGl63!:rPA. 

25 It was found that the rPA in this culture was divided between 
the soluble and insoluble fractions (estimated 350mg/L 
insoluble, 650mg/L full length soluble) . The conditions used 
(37*C, ImM IPTG for induction) had not yielded any detectable 
soluble rPA in shake flask cultures and given the results 

30 described in Example 1 above , the presence of a large amount of 
soluble rPA is surprising. Nevertheless it appears that 
manipulation of the fermentation, induction and point of harvest 
may allow stable accumulation of rPA in coll K12 expression 
strains * 

35 
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Example 3 

A sample of rPA was produced from material initially isolated as 
insoluble inclusion bodies from the UT5600 (DE3) pAGl63:;rPA 
fermentation. Inclusion bodies were washed twice with 25mM 

5 Tris-HCl pH9 and once with same buffer +2M urea. They were then 
solubilized in buffer +8M urea and debris pelleted- Urea was 
removed by dilution into 25mM Tris-HCl pH8 and static incubation 
overnight at 4°C. Diluted san^le was applied to Q sepharose 
column and protein eluted with NaCl gradient. Fractions 

10 containing highest purity rPA were pooled, allquoted and frozen 
at -70°C. Testing of this sample using 4-12% MES-SDS NuPAGE gel 
against a known standard indicated that it is high purity and 
low in endotoxin contamination . 

15 Example 4 

Further Characterisation of the Product 

N terminal sequencing of the product showed that the N-terminal 
sequence consisted of 

20 MEVKQBNRLl (SEQ ID NO 2) 

This confirmed that the product was as expected with initiator 
methionine left on. 

The material was found to react in Western blot; MALDI -MS on 
25 the sample indicated a mass of approx 82 700 (compared to 

expected mass of 82 915) • Given the high molecular mass and 
distance from mass standard used (66KDa) , this is considered an 
indication that material does not have significant truncation 
but does not rule out microheterogeneity within the sample. 

30 

Example 5 

Testing of individual domains of PA 

individual domains of pa were produced as recombinant proteins 
in ^.coli as fusion proteins with the carrier protein 
35 glutathione-s-transferase (GST), using the Pharmacia pGEX-6P-3 
expression system. The sequences of the various domains and 
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the DNA sequence used to encode them are attached herewith as 
Figure 3, The respective amino acid and DNA sequences are 
provided in Table 2 below. 

5 These fusion proteins were used to immunise A/J mice (Harlan 
Olac) intra-muscularly with lOjig of the respective fusion 
protein adsorbed to 20% v/v alhydrogel in a total volume of 
100^1. 

10 Animals were immunised on two occasions and their development of 
protective immunity was determined by challenge with spores of 
B.anthraci9 (STI strain) at the indicated dose levels. The table 
below shows survivors at 14 days post-challenge* 

IS Challenge level in spores /mouse 



Domains 


Amino 
acid 
SEQ 

3D NO 


DMA 
SEQ 
ID 
NO 


5x10* 


9x10* 


9x10 s * 


1x10* 


5x1 O e 


GST-1 


3 


4 


4/4 


3/5 








GST-1+2 


5 


6 


4/4; 
5/5 


4/5; 
5/5 








GST-lb+2 


7 


8 


2/5 


1/5 








GST-lb+2+3 


9 


10 


2/5 


3/5 








G5T-1+2+3 


11 


12 


Nd 


4/5 


3/5 






GST-1+2+3+4 


13 


14 


Nd 


5/5 


5/5 






1+2+3+4 


13 


14 


Nd 


Nd 




5/5 


5/5 



The data shows that a combination of all 4 domains of PA, 
whether presented as a fusion protein with GST or not, were 
protective up to a high challenge level. Removal of domain 4, 
20 leaving 1+2+3, resulted in breakthrough at the highest challenge 
level tested, 9x10 s . Domains 1+2 were as protective as a 
combination of domains 1+2+3 at 9x10* spores. However, removal 
of domain la to leave a GST fusion with domains lb+2, resulted 
in breakthrough in protection at the highest challenge level 
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tested (9xlO a) which was only slightly improved by adding domain 
3, 

The data indicates that the protective immunity induced by PA 
5 can be attributed to individual domains (intact domain 1 and 
domain 4) or to combinations of domains taken as permutations 
from all 4 domains. 

The amino acid sequence and a DNA coding sequence for domain 4 
10 is shovnft in Figure 4 as SEQ ID NOs 15 and 16 respectively. 

Example 6 

Further Testing of_domai_ns as vaccines 

DNA encoding the PA domains, amino acids 1-259, 168-488, 1-4BB, 
15 168-596,1-596, 260-735, 489-735, 597-735 and 1-735 (truncates 

GST1, GSTlb-2, GST1-2, GSTlb-3, GST1-3, GST2-4, GST3-4, GST4 and 
GST1-4 respectively) were PCR amplified from B, aathracis Sterne 
DNA and cloned in to the Xhoi/Bami sites of the expression 
vector pGEX-6-P3 (Amersham- Pharmacia ) downstream and in frame of 
20 the lac promoter. Proteins produced using this system were 

expressed as fusion proteins with an N-terminal glutathione-s- 
transferase protein (GST) . Recombinant plaamid DNA harbouring 
the DNA encoding the PA domains was then transformed in to E. 
coll Bli21 for protein expression studies. 

25 

E.coll BL21 harbouring recombinant pGEX-6-P3 plasmids were 
cultured in L-broth containing 50ug/ml amplcillln, 30ug/ml 
chloramphenicol and 1% w/v glucose. Cultures were incubated 
with shaking (170 rev min" 1 ) at 30 D C to an A 6 oonm 0.4, prior to 
30 induction with O.SmM IPTG. Cultures were incubated for a 

further 4 hours, followed by harvesting by centrifugation at 10 
000 rpm for 15 minutes. 

Initial extraction of the PA tmncates-fusion proteins indicated 
35 that they were produced as inclusion bodies. Cell pellets were 
resuspended in phosphate buffered saline (PBS) and sonicated 
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4x20 seconds in an iced water bath. The suspension was 
centrifuged at 15 000 rpm for 15 minutes and cell pallets were . 
then urea extracted, by suspension in 8M urea with stirring at 
room temperature for 1 hour. The suspension was centrifuged for 
5 15 minutes at 15000 rpm and the supernatant dialysed against 
lOOmM Tris pH 8 containing 4D0mM L-arginine and O.lmM EDTA, 
prior to dialysis into PBS. 

The successful refolding of the PA truncate- fusion proteins 
10 allowed them to be purified on a glutathione Sepharose CL-4B 
affinity column. All extracts (with the exception of truncate 
GSTlb-2/ amino acid residues 168-487) were applied to a 15 ml 
glutathione Sepharose CL-4B column (Amersham-Parmacia) , 
previously equilibrated with PBS and incubated, with rolling, 
15 overnight at 4°C. The column was washed with PBS and the fusion 
protein eluted with 50mM Tris pH7, containing ISOmM NaCl, lmM 
EDTA and 20mM reduced glutathione. Fractions containing the PA 
truncates, identified by SDS-PAGE analysis, were pooled and 
dialysed against PBS . Protein concentration was determined 
20 using BCA (Perbio) . 

However truncate GSTlh-2 could not be eluted from the 
glutathione sepharose CL-4B affinity column using reduced 
glutathione and was therefore purified using ion exchange 

25 chromatography. Specifically, truncate GSTlb-2 was dialysed 
against 20mM Tris pH8, prior to loading onto a HiTrap Q column 
(Amersham-Parmacia) , equilibrated with the same buffer. Fusion 
protein was eluted with an increasing NaCl gradient of 0-1M in 
20mM Tris pH8 , Fractions containing the GST-protein were 

30 pooled, concentrated and loaded onto a HiLoad 26/60 Superdex 200 
gel filtration column (Amersham-Parmacia) , previously 
equilibrated with PBS. Fractions containing fusion protein were 
pooled and the protein concentration determined by BCA (Perbio) , 
Yields were between 1 and 43mg per litre of culture. 

35 
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The molecular weight of the fragments and their recognition by 
antibodies to PA was confirmed using SDS PAGE and Western 
Blotting. Analysis of the rPA truncates by SDS Page and Western 
blotting showed protein bands of the expected sizes. Some 

5 degradation in all of the rPA truncates investigated was 

apparent showing similarity with recombinant PA expressed in -B. 
subtil is. The rPA truncates GST1, GSTlb-2 and GST1-2 were 
particularly susceptible to degradation in the absence of domain 
3- This has similarly been reported for rPA constructs 

10 containing mutations in domain 3 r that could not be purified 
from S. anfcfcracis culture supernatants (Brossier 1999) , 
indicating that domain 3 may stabilise domains 1 and 2. 

Female, specific pathogen free A/J mice (Harlan UK) were used in 
15 this study as these are a consistent model for anthrax infection 
(Welkos 19B6) # Mice were age matched and seven weeks of age at 
the start of the study. 

A/J mice were immunised on days 1 and 28 of the study with lOfig 
20 of fusion protein adsorbed to 20% of 1.3% v/v Alhydrogel (HCI 
Biosector, Denmark) in a total volume of lOOjil of PBS. Groups 
immunised with rPA from B.subtilis (Miller 1998), with 
recombinant GST control protein, or fusion proteins encoding 
domains 1, 4 and 1-4 which had the GST tag removed, were also 
25 included, immunising doses were administered intramuscularly 

into two sites OA the hind legs. Mice were blood sampled 37 days 
post primary immunisation for serum antibody analysis by enzyme 
linked immunosorbant assay (ELISA) . 

30 Microtitre plates (Immulon 2, Dynex Technologies) were coated, 
overnight at 4° C with Sfig/ml rPA, expressed from B.subtilis 
(Millerl998) , in PBS except for two rows per plate which were 
coated with 5pg/ml anti-mouse Fab (Sigma, Poole, Dorset) . 
Plates were washed with pbs containing 1% v/v Tween 20 (PBS-T) 

35 and blocked with 5% w/v skimmed milk powder in PBS (blotto) for 
2 hours at 37° C. Serum, double-diluted in 1% blotto, was added 



PAGE 25/45 ^ RCVD AT KU24/2D05 1:29:15 PM [Eastern Daylight Time] 1 SVR:USPTO-EFXRF-6/33 * DNIS:273KWI0 4 CSID:18584108298 4 DURATION (mm-ss):10-36 



24-OCT-2005 10:31AM FROM-Gen-Proba Patent Dept. 



1 858 410 8298 



T-027 P. 026/045 F-066 



WO 02/04646 PCT/GB03/03065 

20 

to the rPA coated wells and was assayed in duplicate together 
with murine igG standard (Sigma) added to the anti-fab coated 
wells and incubated overnight at 4° C After washing, horse- 
radish peroxidase conjugated goat anti-mouse igG (Southern 

5 Biotechnology Associates Inc.), diluted 1 in 2000 in PBS, was 
added to all wells , and incubated for 1 hour at 37° C. Plates 
were washed again before addition of the substrate 2, 2'-Azinobis 
(3-ethylbenzthiazoline-sulfonio acid) {1.09mM &BTS, Sigma) . 
After 20 minutes incubation at room temperature, the absorbanee 

10 of the wells at 4l4nm was measured {TiterteJc Multiscan, ICN 

Flow) . standard curves were calculated using Titersoft version 
3.1c software. Titres were presented as jig IgG per ml serum and 
group means + standard error of the mean (aem) were calculated. 
The results are shown in Figure 5. 

15 

All the rPA truncates produced were immunogenic and stimulated 
mean serum anti-rPA IgG concentrations in the A/J mice ranging 
from 6ug per ml, for the GSTlb-2 truncate immunised group, to 
1488ug per ml, in the GST 1-4 truncate immunised group (Figure 
20 5). The GsT control immunised mice had no detectable antibodies 
to rPA. 

Mice were challenged with B.anthracis STI spores on day 70 of 
the immunisation regimen. Sufficient STI spores for the 

25 challenge were removed from stock, washed in sterile distilled 
water and resuspendeb in PBS to a concentration of 1x1 0 7 and 
1x10 s spores per ml. Mice were challenged intraperitoneally with 
0.1ml volumes containing lxlO 6 and 1x10 s spores per mouse, 
respectively, and were monitored for 14 day post challenge to 

30 determine their protected status. Humane end-points were 

strictly observed so that any animal displaying a collection of 
clinical signs which together indicated it had a lethal 
infection, was culled. The numbers of immunised mice which 
survived 14 days post challenge are shown in Table 3. 
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Table 3 



Challenge Level MLDs 
Domain survivors/no. challenged (%} 





10 2 






10 3 


MLDs 


GST 1 


3/5 


(60) 




1/5 


(20) 


GST lb-2 


1/5 


(20) 




nd 




GST 1-2 


5/5 


(100) 




3/5 


(60) 


GST lb-3 


3/5 


(60) 




nd 




GST 1-3 


4/5 


(80) 




nd 




GST 1-4 


nd 






5/5 


(100) 


GST 2-4 


nd 






5/5 


(100) 


GST 3-4 


nd 






5/5 


(100) 


GST 4 


5/5 


(100) 




5/5 


(100) 


GST 1+ GST 4 


nd 






5/5 


(100) 


Cleaved 1 


1/5 


(20) 




2/5 




Cleaved 4 


5/5 


(100) 




5/5 




Cleaved 1-4 


nd 






5/5 




rPA 


nd 






4/4 


(100) 


control 


0/5 


(0) 




0/5 


(0) 



5 1 MLD = aprox. 1 x 10 J STI spores 
nd = not done 



The groups challenged with 10 3 MLD's of STI spores were all 
fully protected except for the GST1, GST1-2 and cleaved 1 
10 immunised groups in which there was some breakthrough in 

protection, and the control group immunised with GST only, which 
all succumbed to infection with a mean time to death (MTTD) of 

2.4 + 0.2 days. At the lower challenge level of 10 2 MIiD's the 
GSTl-2, GST 4 and cleaved 4 - immunised groups were all fully 

15 protected, but there was some breakthrough in protection in the 
other groups. The mice that died in these groups had a MTTD of 

4.5 + 0.2 days which was not significantly different from the 
GST control immunised group which all died with a MTTD of 4 + 
0.4 days. 
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Claims 



1. An immunogenic reagent which produces an immune response 
which is protective against Baclllv3 anthracis, said reagent 

5 comprising one or more polypeptides which together represent up 
to three domains of the full length Protective Antigen (PA) of 
B. anthracis or variants of these, and at least one of said 
domains comprises domain 1 or domain 4 of PA ox a variant 
thereof. 

10 

2. An immunogenic reagent according to claim 1 which 
comprises the sequence of domain 1 and/or domain 4 of wild- type 
PA. 



15 3. An immunogenic reagent according to claim 1 ox claim 2 
which comprises domain 4 of the PA of 3- ajjthxaois. 

4, An immunogenic reagent according to any one of the 
preceding claims which comprises a combination of domains 1 and 

20 4 or protective regions thereof. 

5. An immunogenic reagent according to claim 4 wherein said 
domains are present in the form of a fusion polypeptide. 

25 6. An immunogenic reagent according to claim 5 which 
comprises domain 1 fused to domain 2 of the PA sequence. 



7, An immunogenic reagent according to claim 6 which is fused 
to domain 3 of the PA sequence. 

B. An immunogenic reagent according to claim 4 which 
comprises a mixture of a polypeptides/ one of which comprises 
domain 1 and one .of which comprises domain 4 of the PA sequence- 
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9, An immunogenic reagent according to any one of the 
preceding claims wherein a polypeptide is fused to a further 
polypeptide* 

5 10. An immunogenic reagent according to claim 9 wherein said 
further peptide is glutathione-S-transf erase (GST) . 

11 > A nucleic acid which encodes a polypeptide of an 
immunogenic reagent according to any one of the preceding 
10 claims. 

12. An expression vector comprising a nucleic acid according 
to claim Il- 
ls 13. A cell transformed with a vector according to claim 12. 

14, A method for producing an immunogenic polypeptide which 
produces an immune response which is protective against B. 
aflthracis, said method comprising transforming an E. coll host 

20 with a nucleic acid which encodes either (a) the protective 
antigen (PA) of Bacillus anthracis or a variant thereof which 
can produce a protective immune response, or (b) a protective 
domain of the protective antigen (PA) of Bacillus anthxacls or a 
variant thereof which can produce a protective immune response, 

25 culturing the transformed host and recovering the polypeptide 

therefrom, provided that where the polypeptide is the protective 
antigen (PA) of Bacillvs anthracla a variant thereof which can 
produce a protective immune response, the percentage of 
guanidine and cytosine residues within the said nucleic acid is 

30 in excess of 35%. 

15. A method according to claim 14 wherein the said nucleic 
acid encodes the protective antigen (PA) of Bacillus anthxacls 
or a variant thereof which can produce a protective immune 

35 response. 
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16. A method according to claim 15 wherein the percentage of 
guanidine and cytosine residues within the said nucleic acid is 
in excess of 45%. 

5 17, A method according to claim 16 wherein the percentage of 
guanidine and cytosine residues within the 9aid nucleic acid is 
from 50-52%. 

18* A method according to claim 14 wherein the said nucleic 
10 acid encodes a protective domain of the protective antigen (PA) 
of Bacillus anthracla or a variant thereof which can produce a 
protective immune response. 

19. A method according to claim 18 wherein the domain is 
15 domain 1 and/or domain 4 of PA of B. antJiracis, 

20. A recombinant Escherlschia coll cell which has been 
transformed with a nucleic acid which encodes the protective 
antigen (3?A) of Bacillus anthracis or a variant thereof which 

20 can produce a protective immune response^ and wherein the 

percentage of guanidine and cytosine residues within the nucleic 
acid ia in excess of 35%. 

21. A recombinant E&oherlachla coll cell according to claim 20 
25 wherein the percentage of guanidine and cytosine residues within 

the said nucleic acid is in excess of 45%. 

22* A recombinant Eachexlachla coll cell according to claim 21 
wherein the percentage of guanidine and cytosine residues within 
30 the said nucleic acid is from 50%-52S. 

23. A recombinant E. coli cell according to claim 20 wherein 
said nucleic acid is of SEQ ID NO 1 as shown in Figure 2 or a 
modified form thereof* 

35 
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24. A recombinant £. coll cell according to claua 23 wherein 
said nucleic acid is of SEQ ID NO 1. 

25. A recombinant Escherichia cold, cell which has been 

5 transformed with a nucleic acid which encodes a protective 

domain of the protective antigen (PA) of Bacillus anthracls or a 
variant thereof which can produce a protective immune response. 

26. A recombinant cell according to claim 25 wherein the 

10 nucleic acid encodes domain 1 or domain 4 of PA of B. anthracis. 

27. A method of producing a polypeptide which produces an 
immune response which is protective against B. anthracis, said 
method comprising culturing a cell according to any one of 

15 claims 20 to 26 and recovering the protective polypeptide from 
the culture. 

28. An E. coli transformation vector comprising a nucleic acid 
which encodes the protective antigen (PA) of Bacillus anthracis 

20 or a variant thereof which can produce a protective immune 

response, and wherein the percentage of guanidine and cytosine 
residues within the nucleic acid is in excess of 35%, 

29. An £?. coli transformation vector comprising a nucleic acid 
25 which encodes a protective domain of the protective antigen (PA) 

of Bacillus anthracis or a variant thereof which can produce a 
protective immune response. 

30. A nucleic acid of SEQ ID NO 1 or a modified form thereof 
30 which encodes pa or a variant thereof which produces a 

protective immune response and which has at least 35% GC 
content. 

31. a nucleic acid according to claim 30 which is at least 90% 
35 identical to SEQ ID NO 1. 
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32. A nucleic acid according to claim 31 which comprises SEQ 
ID NO 1. 

34, A method of preventing or treating infection by B. 

5 anthracis, said method comprising administering to a mammal in 
need thereof, a sufficient amount of an immunogenic reagent 
according to any one of claims 1 to 10. 

35. The use of an immunogenic reagent according to any one of 
10 claims 1 to 10 in the preparation of a medicament for the 

prophylaxis or treatment of B. anthzaci& infection. 
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Escherichia coil fgbbctU 14457 CDS»s (4S41860 codoos): 
Fields; [triplet] [frequency: pgr thousand] ([number]) 



CJUO 22.0(100128) 
UUC 16.51 74885) 
UOA 13.8 ( 62023) 
UUG 13.3 ( 60322) 

•COT 11- 3< 51442) 
GXJC 10. 6( 48147) 
CUA 4 . 0 C 19067) 
CUG 50.9(231373) 

AUU 29.9(135873) 
AUC 24.6(111878) 
AOA 5.3( 24233) 
AOG 27.2(123604) 

GOT 19. 1( 86572) 
GOC 14.8 ( 67356) 
GUA 11.21 51020) 
GOG 25.5(115687) 



ucu 

OGC 
UCA 
OCG 



9,3( 42367) 
8.9{ 40365) 
7.9( 35837) 
8.7( 39546) 



CCD 7.2 ( 32678) 
CCC S,4( 24383) 
CCA 8.5( 38663) 
CCG 22.3(101467) 

ACQ 9.S( 43256) 
ACC 22.7(103121) 
ACA 7.9( 35995) 
ACG 14>0< 63696) 

GCQ 16.2 ( 73677) 
GCC 25.0(113412) 
GCA 20. 6 { 93390) 
GCG 32*2(146264) 



OAU 16. 7( 75774) 
UAC 12. 3( 55847) 
tJAA 2.0( 9006) 
OAG 0.3( 1172) 

CAU 12. 7( 57585) 
CAC 9. 6t 43743) 
CAA 14. 8( 67129) 
GAG 28.8(130898) 

AAU 18. 7( 84846) 
AAC 21. 6( 98018) 
AAA 34.4(156169) 
AAG 11. 4( 51685) 

GAD 32.3(146794) 
GAC 19- 3( 87759) 
GAA 39.5(179460) 
GAG 18. 5( 83804) 



OGU 5/2 ( 234 61) 
UGC 6.3( 28747) 
UGA 1-Ot 4428) 
OGG 14.5 ( 65630) 

CGU 20.7 ( 93997) 

CGC 21.1 ( 96053) 

CGA 3.7( 16607) 

CGG 5.7 ( 25751) 

AGO 9.1( 41S44) 

AGC 15. 6( 70867) 

AGA 2.7 ( 12345) 

AGG 1.6C 742-3) 

GOT 25.1(114185) 
GGC 28.6(130043) 
GGA 8.6( 39036) 
GGG 11. 1< 50527) 



Coding GC 51.37% 1 st letter GC 58,50% 2 nd letter GC 40.70% 3 rd letter GC S4.90%_ 



Bacillus anihrgcis [gbbctU 180 CDS'b (52031 eodons) 
Fields: [triplet] [foquepcy: per thousand] ([number]) _ 





33. 5( 


1745)' 


uoc 


10. 2( 


530) 


(IDA 


44.2 ( 


2301) 


UDG 11. 3 ( 


589) 


ecru 


14. 7( 


763) 


cue 


3.7( 


195) 


CUA 


13. 2( 


686) 


COG 


4.7( 


242) 


AGO 


44. 6( 


2322) 


AUG 


11. Bf 


616) 


aha 


24. 9 ( 


1295) 


AUG 


23. 8( 


1240) 


Gnu 


19. 9( 


1036) 


GUC 


5.2( 


268) 


GUA 26.8 ( 


1395) 


GUG 


9.7( 


507) 



UC0 17. 3( 
aCC -5.3< 
OCA 14. D( 
OCG 3 . 6 ( 

CCU 10.1 ( 
CCC 2.7 ( 
CCA 14. 9 ( 
CCG . 4.61 

ACU 14. 6 ( 
ACC 5.2 ( 
ACA 25. 9 ( 
ACG 8 . 1 ( 

GCO 17. 9 ( 
GCC 4.7 ( 
GCA 22. 6 ( 
GCG 7.1< 



. 902) 
275) 
730) 
IBB J 

525) 
141) 
773) 
237) 

761) 
269) 
1350) 
419) 

930) 
244) 
1178) 
366) 



OAU 


34. 


.4( 


17 92) 


OGU 


6.K 


319) 


UAC 


9. 


.4( 


490) 


UGC 


2,1( 


107) 


□AA 


2 


•3( 


118) 


UGA 


0.5( 


24) 


UAG 


0. 


■7( 


37) 


UGG 


9. B ( 


511) 


CAD 


16 


-B(. 


873) 


CGU 


10. 9( 


567) 


CAC 


4 


-6( 


239) 


CGC 


2.6( 


137) 


CAA 


33 


-7{ 


1752) 


CGA 


6.B( 


353) 


CAG 


10 


• 4( 


542) 


CGG 


1.81 


95) 


AAD 


44 


♦ G( 


2321) 


AGU 


16.5( 


861) 


AAC 


13 


-7( 


711). 


AGC 


5«1( 


266) 


AAA 


69 


.5( 


3614) 


AGA 


13:8 ( 


720) 


AAG 


23 


.5( 


1223) 


AGG 


4.3( 


226) 


GAU 


39 


-7( 


2066) 


GGU 


17,3 ( 


900) 


GAC 


8 


.B( 


456) 


GGC 


S.4 ( 


279) 


GAA 


55 


•7( 


2897) 


GGA 


20.2 ( 


1049) 


GAG 


19 


.3( 


1003) 


GGG 


B.9 ( 


461) 



Coding PC 23,39% 1 st letter GC 44.51% 2°* letter GC 31.07% 3* letter GC 25.20% 
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1 AAGCTTCATA TGGAAGTAAA GCAAGAGAAC CGTCTGCXGA ACGAATCTGA ATCCAGCTCT 
61 CAGGGCCTGC TTGGTTACTA TTTCTCTGAC CTGAACTTCC AAGCACCGAX GGTTGTAACC 
121 AGCTCTACCA CTGGCGATCT GTCCATCCCG TCTAGT6AAC TTGAGAACAT TCCAAGCGAG 
181 AACCAGTATT TCCAGTCTGC AATCTGGTCC GGTTTTATCA AAGTCAAGAA ATCTGAIGAA 
241 TACACGTTTG CCACCTCTGC TGATAACCAC GTAAGCATGT GGGTTGACGA 7CAGGAAGTG 
301 ATCAACAAAG CATCCAACTC CAACAAAATT CGTCTGGAAA AAGGCCGTCT GTATCACATC 
361 AAGATTCAGT ACCAACGCGA GAACCCGACT GAAAAAGGCC TGGACTTTAA ACTGTAXTGG 
421 ACTGATTCTC AGAACAAGAA AGAAGTGATC AGCTCTGACA ATCTGCAACT GCCGGAAXTG 
481 AAACAGAAAA GCTCCAACTC TCGTAAGAAA CGTTCCACCA GCGCTGGCCC GACCGTACCA 
541 GATCGCGACA ACGATGGTAT TCCGGACTCT CTGGAAGTTG AAGGCTACAC GGTTGATGTA 
601 AAGAACAAAC GTACCTTCCT TAGXCCGTGG ATCTCCAATA TTCACGAGAA GAAAGGTCTG 
661 ACCAAAIACA AATCCAGTCC GGAAAAATGG TCCACTGCAT CTGATCCGTA CTCTGACTTT 
721 GAGAAAGTGA CCGGTCGTAT CGACAAGAAC GTCTCTCCGG AAGCACGCCA TCCACTGGTT 
781 GCTGCGTATC CGATCGTACA TGTTGACATG GAAAACATCA TTTTGTCCAA GAACGAAGAC 
841 CAGTCCACTC AGAACACTGA CTCTGAAACT CGTACCA3PCT CCAAfiAACAC CTCCACGTCT 
901 CGTACTCACA CCAGTGAAGT ACATGGTAAC GCTGAAGTAC ACGCCTCTTT CTTTGACAffC 
961 GGCGGCTCTG TTAGCGCTGG CTTCTCCAAC TCTAATTCTT CTACTGMGC CATTGATCAC 
1021 TCTCTGAGTC TGGCTGGCGA ACGTACCTGG GCAGAGACCA TGGGTCTTAA CACTGCTGAT 
1081 ACCGCGCGTC TGAATGCTAA CA3TCGCTAC GTCAACACTG GTACGGCACC GAXCTACAAC 
1141 GTACTGCCAA CCACCAGCCT GGTTCTGGGT AAfiAACCAGA CTCTTGCGAC CATCAAAGCC 
1201 AAAGAGAACC AACTGTCTCA GATTCTGGCA CCGAATAACT ACTATCCTTC CAAGAACCTG 
1261 GCXCOSATCG CACTGAACGC ACAGGATGAC TTCTCTTCGA CTCCGATCAC CATGAACTAC 
1321 AACCAGTTCC TGGAACTTGA GAAGACCAAA CAGCTGCGTC TTGACACTGA CCAAGTGTAC 
1381 GGTAACATCG CGACCTACAA CTTTGAGAAC GGTCGCGTCC GCGTTGACAC AGGCTCTAAT 
1441 TGGTCTGAAG TACTGCCTCA GATTCAGGAA ACCACCGCTC GTATCAXCTT CAACGGTAAA 
1501 GACCTGAACC TGGTTGAACG TCGTATTGCT GCTGTGAACC CGTCTGATCC ASTAGAGACC 
1561 ACCAAACCGG ATATGACTCT GAAAGAAGCC CTGAAGATCG CCTTTGGCTT CAACGAGCCG 
1621 AACGCTAATC TTCAGTACCA AGGTAAAGAC ATCACTGAAT 1TGACTTCAA CTTTGATCAG 
1681 CAGACCTCTC AGAAXATCAA GAACCAACTG GCTGAGCTGA ACGCGACCAA TATCTATACG 
1741 GTACTCGACA AGATCAAACT GAACGCGAAA AIGAACATTC TGATTCGCGA CAAACGTTTC 
1801 CACTACGATC GTAATAACAT CGCTGTTGGC GCTGATGAAT CTGTXGTGAA AGAAGCGCAT 
1861 CGCGAAGTCA TCAACTCCAG CACCGAAGGC CTGCTTCTGA ACATCGACAA AGACATTCGT 
1921 AAGATCCTGT CTGGTTACAT TGTTGAGATC GAAGACACCG AAGGCCTGAA AGAAGTGATC 
1981 AATGATCGTT ACGACATGCT GAAGATCAGC TCTCTGCGTC AAGATGGTAA GACGTTCATT 
2041 GACTTGAAGA AATACAACGA CAAACTTCCG CTGTA01ATCT CTAATCCGAA CTACAAAGTG 
2101 AACGTTTACG CTGTTACCAA ASAGAACACC ATCATCAATC CATCTGAGAA CGGC6ATACC 
2161 TCTACCAACG GTATCAAGAA GATTCTGATC TTCTCCAAGA AAGGTTACGA GMCGGTTAA 
2221 TAGGATCC 



(SEQ ID NO 1) 
Figure 2 
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1 EVKQENRLLfl ESESSSQGId. GYYFSDLNFQ 
61 QSAIWSGFIK VKKSDEYTETi TSADNHVTMW 
121 QKENPTEKGL DFKLYWTDSQ NKKEVTSSDN 
181 DGIPDSLEVE GYTTOVKNKR TFMPWISNI 
241 GRIDKUVSPE ARHPLVAA 



AFMWTS3TT GDLSIPSSEB ENIPSENQYF 
VDDQEVINKA SNSNKIRIEK GRLYQIKIQY 
LQLPELKQKS SNSRKKRSTS AGPTVPDRDN 
HEKKGLTKYK 3SPEKWSTAS DPYSDFEKVT 



(Seq ID NO 3) 



1 gaagttaaac aggagaaccg gttattaaat gaatcagaat caagttccca ggggttacta 
61 ggatactatt ttagtgattt gaattttcaa gcacccatgg tggttacctc ttctactaca 
121 ggggatttat ctattcctag ttctgagtta gaaaatattc catcggaaaa ccaatatttt 
181 caatctgcta tttggtcagg atttatcaaa gttaagaaga gtgatgaata tacatttgct 
241 acttccgotg ataatcatgt aacaatgtgg gtagatgacc aagaagtgat taataaagct 
301 tctaattcta acaaaatcag attagaaaaa ggaagattat atcaaataaa aattcaatat 
361 caacgagaaa atcctactga aaaaggattg gatttcaagt tgtactggac cgattctcaa 
421 aataaaaaag aagtgatttc tagtgataac ttacaattgc cagaattaaa aeaaaaatct 
481 tcgaactcaa gaaaaaagcg aagtacaagt gctggaccta cggttccaga cogtgacaat 
541 gatggaatcc ctgattcatt agaggtagaa ggatatacgg ttgatgtcaa aaataaaaga 
601 acttttcttt caccatggat ttctaatatt catgaaaaga aaggattaac caaatttaaa 
661 toatctcctg aaaaatggag cacggcttct gatccgtaca gtgatttcga aaaggttaea 
721 ggacggattg ataagaatgt atcaccagag gcaagacacc cccttgtggc agct 



1 EVKQENRLLN ESE3SSQG1L GYYFSDLNFQ APMWTSSTT GDLSIPSSEL ENIPSENQYF 
61 QSMHSGFIK VKKSDEYTFA TSADNHVTMW VDDQEVTNKR SNSNKIRLEK GKLYQTKIQY 
121 QRENPTEKGL DFKLYWTDSQ NKKEVTSSDN LQLEELKQKS SNSRKKRSTS AGPTVPDRDN 
181 OG1PDSLEVE GYTVDVKNKR TFLSPWISNI HEKKGIiTKYK SSPEKWSTAS DPYSDFEKVT 
241 GRIDKNVSPE ARHPLVftAYP IVHVDMENII LSKNEPQSTQ NTDSETRTIS KNTSTSRTHT 
301 SEVHGNAfiVH ASFFDIGGSV SAGFSNSNSS TVAlDHSMIi AGERIWftETM GLNTADTARI 
361 NANXHYVOTG TAPIYNVLPT TSJVTLGKNQT LfcTIKAKENQ LSQIIAPNNY YPSKNIAPIA 
421 LNAQDDFSST PITMNYNQFL ELEKTKQXKi DTDQVYGNIA TYtiFENGRVR VDTGSNWSEV 
481 LPQIQET 



(3eq ID No 4) 



(SEQ ID NO 5) 
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1 gaagttaaac aggagaaccg gttattaaat gaatcagaat caagttccca ggggttacta 
61 ggatactatt ttagtgattt gaattttcaa gcacccatgg tggttacttc ttctactaca 
121 ggggatttat ctattcctag ttotgagtta gaaaatattc catcggaaaa ccaatatttt 
181 caatetgcta tttggtcagg atttatcaaa gttaagaaga gtgatgaata tacatttgct 
241 acttccgctg ataatcatgt aacaatgtgg gtagatgacc aagaagtgat taataaagct 
301 tctaattcta acaaaatcag attagaaaaa ggaagattat atcaaataaa aattcaatat 
361 caacgagaaa atcctaetga aaaaggattg gatttcaagt tgtactggac cgattctcaa 
421 aataaaaaag aagtgatttc tagtgataac ttacaactgc cagaattaaa acaaaaatct 
481 tcgaactcaa gaaaaaagcg aagtacaagt gctggaecta cggttccaga cegtgacaat 
541 gatggaatcc ctgattcatt agaggtagaa ggatatacgg ttgatgtcaa aaataaaaga 
601 acttttcttt caccatggat ttctaatatt catgaaaaga aaggattaac caaatataaa 
661 tcatetcctg aaaaatggag cacggcttct gatccgtaoa gtgatttcga aaaggttaea 
721 ggacggattg ataagaatgt atcaccagag gcaagacacc cccttgtggc agcttatccg 
7B1 attgtacatg tagatatgga gaatattatt ctctcaaaaa atgaggatca atccacacag 
841 aatactgata gtgaaacgag aacaataagt aaaaatactt etacaagtag gacacatact 
901 agtgaagtac atggaaatgc agaagtgcat gcgtcgttct ttgatattgg tgggagtgta 
961 tctgcaggat ttagtaattc gaattcaagt acggtcgcaa ttgatcattc actatctcta 
1021 gcaggggaaa gaacttgggc tgaaacaatg ggtttaaata ccgctgatac agcaagatta 
1081 aatgccaata ttagatatgt aaatactggg acggctccaa tctaeaacgt gttaccaacg 
1141 acttcgttag tgttaggaaa aaatcaaaca ctcgcgacaa ttaaagctaa ggaaaaccaa 
1201 ttaagtcaaa tacttgcacc taataattat tatccttcta aaaacttggc gccaatcgca 
1261 ttaaatgcac aagacgattt cagttctact ccaattacaa tgaattacaa tcaatttctt 
1321 gagttagaaa aaacgaaaca attaagatta gatacggatc aagtatatgg gaatatagca 
1391 acatacaatt ttgaaaatgg aagagtgagg gtggatacag gctcgaactg gagtgaagtg 
1441 ttaccgcaaa ttcaagaaac a 



(SEQ ID Ko 6} 



1 SAGPTVPDRD NDGIPDSLEV EGYTVDVKNK RTFLSPWJSN JHEKB0SLTKX" KSSPEKttSTA 
61 SDPYSDFEKV TGRIDKNVSP EARHPLVAAY PIVHVDMENI U*SKNEDQST QNTDSETRTI 
121 SKNTSTSRTH TSEVHGNAEV BASFFDIGG3 VSAGFSNSNS STVAIDHSIiS LAfiERTWAET 
1B1 MGLNTADTAR LNANIRYVNT GTAPIYNVLP TTSXiVLGKNQ TLAJXK&KEN QLSQlLAPNN 
241 YYPSKNLAPI A1NAQDDFSS TPITMNYNQF LELEKTKQLR &MDQVYGNI ATYNFSNGKV 
301 RVDTGSNWSB VLPQIQET 



(SEQ ID No 1) 



Figure 3 Cont. 
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1 agtgctggac ctacggttcc agaccgtgac 
61 gaaggatata cggttgatgt caaaaataaa 
121 attcatgaaa agaaaggatt aaccaaatat 
181 tctgatccgt acagtgattt cgaaaaggtt 
241 gaggcaagac acccccttgt ggcagettat 
301 attctctcaa aaaatgagga tcaatccaca 
361 agtaaaaata cttctacaag taggacacat 
421 catgcgtcgt tctttgatat tggtgggagt 
481 agtacggtcg caattgatca ttcactatot 
541 atgggtttaa atacegctga tacagcaaga 
601 gggaeggctc caatctacaa egtgttacea 
661 acactcgega caattaaagc taaggaaaac 
721 tattatcctt ctaaaaactt ggcgocaatc 
781 actccaatta caatgaatta caatcaattt 
841 ttagatacgg atcaagtata tgggaatata 
901 agggtggata caggctcgaa ctggagtgaa 

(SEQ ID tic 



aatgatggaa tccctgattc attagaggta 
agaacttttc tttcaccatg gatttctaat 
aaatcatctc ctgaaaaatg gagcacggct 
acaggacgga ttgataagaa tgtatcacca 
ccgattgtac atgtagatat ggagaatatt 
cagaatactg atagtgaaac gagaacaata 
actagrgaag tacatggaaa tgcagaagtg 
gtatetgcag gatttagtaa ttcgaattca 
ctageagggg aaagaacttg ggctgaaaca 
ttaaatgcca atattagata tgtaaatact 
acgacttcgt tagtgttagg aaaaaatcaa 
caattaagtc aaatacttgc acctaataat 
gcattaaatg cacaagacga tttcagttct 
cttgagttag aaaaaacgaa acaattaaga 
goaacataca attttgaaaa tggaagagtg 
gtgttaccgc aaattcaaga aaca 



1 SAGPTVPDRD NDGIPDSLEV EGYTVDVKNK RTFLSPWISN IHEKKGLTKY KS3PBKWSTA 
61 SDPYSDPEKV TGRIDKNVSP E&RHPLVAAY PIVHVDMENI IIiSKNSDQST QNTDSETRTI 
121 SKNTSTSRTH TSEVHGNAEV HASFFDlGGS VSAGFSNSNS pTVAIDHSLS LAGERTWAET 
181 MGLNTADTAR LNANIRXVNT GTAPIYNVLP TTSLVLGKNQ TIATIKAKEN QLSQIIAPNN 
241 YYPSKNLAFI AI«WAQPDFS$ TPITMMTOQF LELEKTKQLR LDTDQVYGNI ATYNFENGRV 
301 RVDTGSNWSE VLFQlQETTA RIIFNGKDLN LVBRRIAAVN P3DFLETTKP DMT1KBALKI 
361 ATOFNEPNGN LQYQGK0ITE FDFNFDQQIS QNIKNQLAEL ttAINIYTVLD KIKXNAKMNI 
421 LIRDKR 

(SEQ ID No 9) 



Figure 3. Cont. 
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1 agtgctggac ctacggttcc agaccgtgac aatgatggaa tccctgattc attagaggta 
61 gaaggatata cggttgatgt caaaaataaa agaacttttc tttcaccatg gatttctaat 
121 attcatgaaa agaaaggatt aaecaaatat aaatcatctc ctgaaaaatg gagcacggct 
181 tctgatccgt acagtgattt cgaaaaggtt acaggacgga ttgataagaa tgtatcacca 
241 gaggcaagac accceettgt ggcagcttat ccgattgxac atgtagatat ggagaatatt 
301 attctctcaa aaaatgagga tcaatccaea cagaataetg atagtgaaac gagaacaata 
361 agtaaaaata cttctaoaag taggacacat actagtgaag tacatggaaa tgcagaagtg 
421 catgcgtcgt tctttgatat tggtgggagt gtatctgqag gatttagtaa ttcgaattca 
481 agtacggtcg caattgatca ttcactatct ctagcagggg aaagaacttg ggctgaaaca 
541 atgggtttaa ataccgctga tacagcaaga ttaaatgcca atattagata tgtaaatact 
601 gggacggctc caatotacaa cgtgttacca acgacttcgt tagtgttagg aaaaaatcaa 
661 acactcgcga caattaaagc taaggaaaac caattaagtc aaatacttgc acctaataat 
721 tattatcctt otaaaaactt ggcgccaatc gcattaaatg cacaagacga tttcagttct 
7B1 actccaatta caatgaatta caatcaattt cttgagttag aaaaaacgaa acaattaaga 
841 ttagatacgg atcaagtata tgggaatata gcaacataca attttgaaaa tggaagagtg 
901 agggtggata oaggctcgaa ptggagtgaa gtgttaccgc aaattcaaga aacaactgca 
961 cgtatcattt ttaatggaaa agatttaaat ctggtagaaa ggcggatagc ggcggttaat 
1021 ecuagtgatc cattagaaac gactaaaccg gatatgacat taaaagaagc ccttaaaata 
1081 gcatttggat ttaacgaacc gaatggaaac ttacaatatc aagggaaaga cataaccgaa 
1141 tttgatttta atttcgatca acaaacatct caaaatatca agaatcagtt agcggaatta 
1201 aacgcaaota acatanatac tgtattagat aaaatcaaat taaatgcaaa aatgaatatt 
1261 ttaataagag ataaacgt 



(SEQ ID No 10) 



1 EVKQBNRL18 ESESSSQGLL GYYFSDLNFQ APMWTSSTT GDLSIPSSGI. ENIPSENQYT? 
61 QSAIWSGFIK VKKSDEYTFA TSADNHVTMW VDDQEVINKA SNSNKIKE.SK GKLYQIKIQY 
121 QRENPTEKGL DFKXYttTDSQ NKKEVIC33DH LQLPBLKQKS SNSRKKRSTS AGPTVPDRDN 
181 DGIPDSLEVE GYTVDVKNKR TFLSFWI3NI HEKKGLTKYK SSPEKWSTAS DPtfSDFEKVT 
241 (3RIDKNV3PE ARHPLVAAYP IVHVDMEN1I 1SKNEDQSTQ NTDSETRTIS KNTSTSRTHT 
301 SEVBGNAEVH ASFFDIGGSV SAG?SNSNSS TVAIDHSLSL AGERTWAE1M GIOTADTARL 
361 Nj&NIRYVNTG TAPIYNVLPT TSLTOGKNQT LATIKAKENQ LSQILAPNNY YPSKNLAPIA 
421 LNAQDDFSST PITMNYNQFX, ELEKTKQLKL DTDQVYGNIA TTOFENGRVR VOTGSNWSEV 
481 LPOIQETTAR IIFNGKDLNL VERRIAAVNP SDPLETTKPD MT1KEALKIA FGF^EPNGNIi 
541 QYQGKDITEF DFNFDQQTSQ NIKNQIAELN ATNIYTVLDK IIOHAKMNIL IRDKR 

(SEQ ID NO 11) 



Figure 3 Cont. 
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1 gaagttaaac aggagaaccg gttattaaat gaatcagaat caagttccca ggggttacta 
61 ggatactatt ttagtgattt gaattttcaa gcacccatgg tggttacctc ttctactaca 
3.21 ggggatttat ctattcrtag ttctgagtta gaaaatattc catcggaaaa ccaatatttt 
181 caatetgcta tttggtcagg atttatcaaa gttaagaaga gtgatgaata tacatttgct 
241 acttccgctg ataatcatgt aacaatgtgg gtagatgacc aagaagtgat -taataaagct 
301 tctaattcta acaaaatcag attagaaaaa ggaagattat atcaaataaa aattcaatat 
361 caacgagaaa atootactga aaaaggattg gatttcaagt tgtactggac cgattctcaa 
421 aataaaaaag aagtgatttc tagtgataac ttacaattgc cagaattaaa acaaaaatct 
4B1 tcgaactcaa gaaaaaagcg aagtacaagt gctggaccta cggttccaga ccgtgacaat 
541 gatggaatcc qtgattcatt agaggtagaa ggatatacgg ttgatgtcaa aaataaaaga 
601 acttttcttt caccatggat ttctaatatt catgaaaaga aaggattaac caaatataaa 
661 tcatctcctg aaaaatggag cacggcttet gatccgtaca gtgatttcga aaaggttaca 
721 ggaeggattg ataagaatgt atcaccagag gcaagacacc cccttgtggc agcttatccg 
781 attgtacatg tagatatgga gaatattatt ctctcaaaaa atgaggatca atccacacag 
841 aatactgata gtgaaacgag aacaataagt aaaaatactt ctacaagtag gacacataet 
901 agtgaagtac atggaaatgc agaagtgcat gcgtcgttct ttgatattgg tgggagtgta 
961 tctgcaggat ttagtaattc gaattcaagt acggtcgcaa ttgatcattc actatotcta 
3.021 gaaggggaaa gaacttgggc tgaaacaatg ggtttaaata ecgctgatac agcaagatta 
1081 aatgccaata ttagatatgt aaatactggg acggctccaa tctacaacgt gttaccaacg 
1141 acttcgttag tgttaggaaa aaatcaaaca etcgcgacaa ttaaagctaa ggaaaaccaa 
1201 ttaagtcaaa tacttgcacc taataattat tatecttcta aaaacttggc gocaatcgca 
1261 ttaaatgcac aagacgattt cagttctact ccaattacaa tgaattacaa tcaatttctt 
1321 gagttagaaa aaacgaaaca attaagatta gatacggatc aagtatatgg gaatatagca 
1381 acatacaatt ttgaaaatgg aagagtgagg gtggatacag gctcgaaotg gagtgaagtg 
1441 ttaccgcaaa ttcaagaaac aactgcacgt atcattttta atggaaaaga tttaaatctg 
1501 gtagaaaggc ggatagcggc ggttaatcct agtgatccat tagaaacgac taaaccggat 
1561 atgacattaa aagaagccct taaaatagca tttggattta acgaaccgaa tggaaactta 
1621 caatatcaag ggaaagacat aaccgaattt gattttaatt tcgatcaaca aacatctcaa 
1681 aatatcaaga atcagttagc ggaattaaac gcaactaaca tatatactgt attagataaa 
1741 atcaaattaa atgcaaaaat gaatatttta ataagagata aacgt 

(SEQ ID No 12) 



1 EVKQENKLLN ESESSSQGLL GYYFSDLNFQ ASMWTSSTT GDLSIPSSEL ENIPSENQYF 
61 QSAIWSGFIK VKKSDEYTFA TSADNHVTMW VDDQEVTNKA SNSNKIRLEK GRLYQlJQQY 
121 QKENPTEKGL DFKLYTODSQ NKKEVISSDN LQLfELKQKS SNSRKKRST9 AGPTVPDRDN 
181 DGIPDSLEVE GYTVDVKNKfc TFLSFWISNI HBKKGLTKYK SSPEKWSTAS DPYSDFEKV* 
241 GRIDKNVSPE ARUPLVAAYP IVKTOMEtfll LSKNEDQSTQ NTDSQTRTIS KNTSTSKTHT 
301 SEVHGNAEVH ASFFDIGGSV SAGFSNSNSS TVAIDBSWI, AGEMWfcETM GLNTADTARL 
361 MW3IRYVNTG TAPlYNVLPt TSLVLGKNQT LATIKAKENQ LSQILAPNNY YPSKNLAPIA 
421 LNAQDPFSST PITMNTOQFL ELEKTKQLBI. OTDQVYGttXA TYNFENGRVR VDTGSNWSEV 
481 LPQIQETTAR IIPNGKDtNL VERRlAAVNP SDPLETTKPD MTLKEALKXA FGFNEPNGNL 
541 QYQGKDITEF DFNFDQQTSQ NJKNQLAELN ATN1YTVLDK IKLNAKMNXL IRDKRFHYDR 
601 NMIAVGADES WKEAHREVl NSSTEGLLLN IDKDlRKIIiS GYIVEIEDTE GLKEVTNDRY 
661 DMLNISSLRQ DGKTFIDFKK YNDKLPLYTS NPttYKVNVYA VTKENTIINP SENGDTSTtfG 
721 IKK1LIFSKK GYEIG 

(9EQ ID No 13) 
Figure 3 Cont. 
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1 gaagttaaac aggagaaccg gttattaaat gaatcagaat caagttccca ggggttacta 
61 ggatactatt ttagtgattt gaattfctcaa gcacccatgg tggttacctc ttctactaca 
121 ggggatttat ctattcctag ttctgagtta gaaaatattc catcggaaaa ccaatatttt 
181 caatctgcta tttggtcagg atttatcaaa gttaagaaga gtgatgaata tacatttgct 
241 acttccgctg ataatcatgt aacaatgtgg gtagatgacc aagaagtgat taataaagct 
301 tctaattcta acaaaatcag attagaaaaa ggaagattat atcaaataaa aattcaatat 
361 caacgagaaa atcctaetga aaaaggattg gatttcaagt tgtactggac cgattctcaa 
421 aataaaaaag aagtgatttc tagtgataac ttacaattgo cagaattaaa acaaaaatct 
481 tcgaactcaa gaaaaaagcg aagtacaagt gctggaccta cggttccaga ccgtgacaat 
541 gatggaatcc ctgattcatt agaggtagaa ggatatacgg ttgatgtcaa aaataaaaga 
601 acttttcttt caceatggat ttctaatatt catgaaaaga aaggattaac caaatataaa 
661 tcatctcctg aaaaatggag oaoggcttct gatccgtaca gtgatttcga aaaggttaca 
721 ggacggattg ataagaatgt atcaccagag gcaagacacc cccttgtggc agcttatccg 
781 attgtacatg tagatatgga gaatattatt ctctcaaaaa atgaggatca atccacacag 
841 aatactgata gtgaaacgag aacaataagt aaaaatactt crtacaagtag gacacatact 
901 agtgaagtac atggaaatgc agaagrtgcat gcgtcgttct ttgatattgg tgggagtgta 
961 tctgcaggat ttagtaattc gaattcaagt acggtcgcaa ttgatcattc actatctcta 
1021 gcaggggaaa gaacttgggc tgaaacaatg ggtttaaata ccgctgatac agcaagatta 
1081 aatgccaata ttagatatgt aaatactggg acggctccaa tctacaacgt gttaccaacg 
1141 acttcgttag tgttaggaaa aaatcaaaca ctcgcgacaa ttaaagctaa ggaaaaccaa 
1201 ttaagtcaaa tacttgcacc taataattat tatccttcta aaaacttggc gccaatcgca 
1261 ttaaatgcac aagacgattt cagttctact ccaattacaa tgaattacaa tcaatttctt 
1321 gagttagaaa aaacgaaaca attaagatta gatacggatc aagtatatgg gaatatagca 
1381 acatacaatt ttgaaaatgg aagagtgagg gtggatacag gctcgaactg gagtgaagtg 
1441 ttacogcaaa ttcaagaaac aactgcacgt atcattttta atggaaaaga tttaaatctg 
1501 gtagaaaggc ggatagcggc ggttaatcct agtgatccat tagaaacgao taaaccggat 
1561 atgacattaa aagaagccct taaaatagca tttggattta acgaaccgaa tggaaactta 
1621 caatatcaag ggaaagacat aaccgaattt gattttaatt tcgatcaaca aacatctcaa 
16 Bl aatatcaaga atcagttagc ggaattaaac gcaactaaca tatatactgt attagataaa 
1741 atcaaattaa atgcaaaaat gaatatttta ataagagata aacgttttca ttatgataga 
1801 aataacatag cagttggggc ggatgagtca gtagttaagg aggctcatag agaagtaatt 
1861 aattcgtcaa cagagggatt attgttaaat attgataagg atataagaaa aatattatca 
1921 ggttatattg tagaaatxga agatactgaa gggottaaag aagttataaa tgacagatat 
1981 gatatgttga atatttctag tttacggcaa gatggaaaaa catttataga ttttaaaaaa 
2041 tataatgata aattaeegtt atatataagt aatcecaaut ataaggtaaa tgtatatgct 
2101 gttactaaag aaaacactat tattaatcct agtgagaatg gggatattag taccaacggg 
2161 atcaagaaaa ttttaatctt ttctaaaaaa ggotatgaga taggataa 

(SEQ ID NO 14) 



Figure 3 corn:. 
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1 FRYDRNNIAV GAOfiSWKEA HREVINSSTE GLLLNIDKDI BKILSGYIVB JEDTEGLKEV 
61 INDRYDMLNI SSLRQDGKTF IDFKKYNDKL PLYISNPNYK VNVYAVTKEN TIINPSENGD 
121 TSTOGIKKIL IFSKKGYEIG 

(SEQ ID No 15) 



1 tttcattatg atagaaataa catagcagtt ggggcggatg agtcagtagt taaggaggct 
61 catagagaag taattaattc gtcaacagag ggattattgt taaatattga taaggatata 
121 agaaaaatat tatcaggtta tattgtagaa attgaagata ctgaagggct taaagaagtt 
181 ataaatgaca gatatgatat gttgaatatt tctagtttac ggcaagatgg aaaaacattt 
241 atagatttta aaaaatataa- tgataaatta ccgttatata taagtaatce caattataag 
301 gtaaatgtat atgctgttac taaagaaaac actattatta atcctagtga gaatggggat 
361 actagtacoa acgggatcaa gaaaatttta atcttttcta aaaaaggcta tgagatagga 
421 taa 

(SEQ ID No 16) 



Figure 4 
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